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Abstract 



Background: Many biological systems are modeled qualitatively with discrete models, such as probabilistic 
Boolean networks, logical models, Petri nets, and agent-based models, with the goal to gain a better 
understanding of the system. The computational complexity to analyze the complete dynamics of these models 
grows exponentially in the number of variables, which impedes working with complex models. Although there 
exist sophisticated algorithms to determine the dynamics of discrete models, their implementations usually 
require labor-intensive formatting of the model formulation, and they are oftentimes not accessible to users 
without programming skills. Efficient analysis methods are needed that are accessible to modelers and easy to 
use. 

Method: By converting discrete models into algebraic models, tools from computational algebra can be used to 
analyze their dynamics. Specifically, we propose a method to identify attractors of a discrete model that is 
equivalent to solving a system of polynomial equations, a long-studied problem in computer algebra. 
Results: A method for efficiently identifying attractors, and the web-based tool Analysis of Dynamic Algebraic 
Models (ADAM), which provides this and other analysis methods for discrete models. ADAM converts several 



1 



discrete model types automatically into polynomial dynamical systems and analyzes their dynamics using tools 
from computer algebra. Based on extensive experimentation with both discrete models arising in systems 
biology and randomly generated networks, we found that the algebraic algorithms presented in this manuscript 
are fast for systems with the structure maintained by most biological systems, namely sparseness, i.e., while the 
number of nodes in a biological network may be quite large, each node is affected only by a small number of 
other nodes, and robustness, i.e., small number of attractors. For a large set of published complex discrete 
models, ADAM identified the attractors in less than one second. 

Conclusion: Discrete modeling techniques are a useful tool for analyzing complex biological systems and there is 
a need in the biological community for accessible efficient analysis tools. ADAM provides analysis methods 
based on mathematical algorithms as a web-based tool for several different input formats, and it makes analysis 
of complex models accessible to a larger community, as it is platform independent as a web-service and does not 
require understanding of the underlying mathematics. 



Background 

Mathematical modeling is a crucial tool in understanding the dynamic behavior of complex biological 
systems. Discrete models are now widely used for this purpose. Model types include (probabilistic) 
Boolean networks, logical networks, Petri nets, cellular automata, and agent-based (individual-based) 
models, to name the most commonly found ones fT^-W\. 

There are several existing software packages for simulation and analysis of discrete models. These include 
GINsim, BoolNet R, Snoopy, signaling Petri net-based simulator in the PathwayOracle toolkit, DDLab, 
GenYsis-P Toolbox , and BN/PBN Matlab Toolbox [7 - 14 . Each package has been developed to suit the 
needs of a particular community, and each package is designed to analyze a different model type. They are 
discussed in detail in Results and Discussion. 

Simulation or exhaustive enumeration of the state space is common practice to analyze discrete models, 
but it is limited by computational complexity, as the state spaces grows exponentially in the number of 
variables. When relying on simulation on a standard desktop computer, most software tools are limited to 
40 variables or less, i.e., 10^'^ states for Boolean systems. Complex models can be analyzed with SAT or 
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BDD-based (Binary Decision Diagram) algorithms, but these algorithms are usually not available as 
software tools for a broad range of users. Most implementations are platform specific and require a 
particular input format that is not used by other software tools. Changing a model to the required format 
is often a labor-intensive process. GINsim, a package designed for the analysis of gene regulatory networks, 
uses a method based on binary decision diagrams to efficiently analyze asynchronous logical models, and 
provides this method without requiring the user to understand the underlying algorithm, but GINsim is 
specific to logical models and cannot import other discrete modeling types or formats 7|. 
Here, we present the web-based tool ADAM, Analysis of Dynamic Algebraic Models [15 , a tool to study 
the dynamics of a wide range of discrete models. ADAM provides efficient analysis methods based on 
mathematical algorithms as a web-based tool for several different input formats, and it makes analysis of 
complex models accessible to a larger community, as it is platform independent as a web-service and does 
not require understanding of the underlying mathematics. ADAM is the successor to DVD, Discrete 
Visualizcr of Dynamics [16] , a tool to visualize the temporal evolution of small polynomial dynamical 
systems. 

Different types of discrete models, including (probabilistic) Boolean networks, logical networks, Petri nets, 
cellular automata, and agent-based models, can be converted into the unifying framework of algebraic 



models, namely polynomial dynamical systems 17 18 . This allows to apply tools from computational 
commutative algebra to analyze the models more efficiently than by simulation and without using heuristic 
methods. We used ADAM on several logical models and on published Boolean models with up to 60 



variables 19-21 . These models are too large for a straight-forward analysis by exhaustive enumeration of 
the state space, and the corresponding publications contain lengthy explanations and supplementary 
material that outline the calculations and algorithms used to identify the attractors. ADAM greatly 
simplifies the analysis; it identified the steady states of these models in less than one second. We believe 
that complex discrete models will gain more popularity, if sophisticated analysis methods are easily 
accessible to modelers. 

In addition to giving access to mathematical theory for efficient analysis, algebraic models provide a 
unifying framework and systematic approach for several model types, which allows for an effective 
comparison of heterogeneous models, such as a Boolean network model and an agent-based model. For 
community integration to the biological sciences, ADAM contains a model repository of previously 



published models available in ADAM specific format 22 . This allows new users to familiarize themselves 
quickly with ADAM and to validate and experiment with existing models. In the following section, we 
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discuss general features of ADAM briefly and explain new features in more detail. 



Results and Discussion 

A wide range of software and algorithms exist to analyze discrete models. These tools are either limited by 
complexity as they rely on simulation as analysis method, or they are inaccessible to biologists not familiar 
with programming, as they are often times only available as platform dependent command-line tools. 
Furthermore, implementations require different input formats, which hampers the use of different software 
tools on the same model. 

ADAM is an analysis tool for discrete models, available as a web-based tool, which hides and encapsulates 
the mathematical algorithms from the user. Therefore, users who lack understanding of the underlying 
mathematics or programming expertise can use efficient algorithms to analyze complex models. 
We propose a novel method to identify attractors of a discrete model. This method relies on the fact that 
many discrete models can be translated into the algebraic framework of polynomial dynamical systems. 
Using these polynomials, one can construct a system of polynomial equations, such that its solutions 
correspond to fixed points or limit cycles. Thus, the problem of identifying attractors becomes equivalent 
to solving a system of polynomial equations over a finite field. This is a long-studied problem in computer 
algebra, and can usually be solved efficiently by computing a Grobner basis. 

This method is not a new mathematical algorithm to solve polynomial dynamical systems, but a novel 
approach that uses the fact that attractors are the solutions of polynomial systems derived from the model 
when expressed in the algebraic framework. ADAM allows users unfamiliar with polynomial dynamical 
systems or Grobner basis to benefit from this efhcient algorithm. 



General Features 

ADAM automatically converts discrete models into polynomial dynamical systems, that is time and state 



discrete dynamical systems described by polynomials over a finite field (see Appendix A.l for definition 
and example). The dynamics of the models are then analyzed by using various computational algebra 
techniques. Even for large systems, ADAM computes key dynamic features, such as steady states, in a 
matter of seconds. ADAM is available online and free of charge. It is platform independent and does not 
require the installation of software or a computer algebra system. 

ADAM can analyze discrete models. It translates the following inputs into (probabilistic) polynomial 
dynamical systems and can then analyze all of them except models originating from Petri nets. 
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• Logical models generated with GINsim [t] 

• Petri nets generated with Snoopy [o] 

• polynomial dynamical systems 

• Boolean networks 

• probabilistic polynomial dynamical systems, probabilistic Boolean networks (PBN) [s]. 
We plan to implement analysis methods for Petri nets in future versions. 

ADAM's main application is the analysis of the dynamic features of a model, which includes the 
identification of stable attractors. These are either steady states, i.e., time-invariant states, or limit cycles, 
i.e., time-invariant sets of states. ADAM is capable of identifying all steady states and limit cycles of 
length up to a user-specified length m. The process of finding long limit cycles is quite slow for large 
models, however, in biological models limit cycles are likely to be short, so that m can be chosen to be 
small in general, i.e., less than ten. 

The temporal evolution of the model can be visualized by the phase space, a graph of all possible states 
and their transitions, also called state space or state transition graph. For small enough models, i.e., less 
than eleven variables, ADAM generates a graph of the complete phase space; for larger models, ADAM 
uses algebraic algorithms to determine dynamic properties. Independent of network size, ADAM generates 
a wiring diagram. The wiring diagram, also known as dependency graph, shows the static relationship 
between the variables. All edges in ADAM's wiring diagrams are functional edges, that is there exists at 
least one state such that a change in the input variable causes a change in the output variable (see 



Appendix A. 2 for more details). This means that ADAM determines all non-functional edges, which is 
oftentimes of interest. 

With ADAM, one can also study the temporal evolution of user-specified initial states. The trajectory of a 
state describes the state's evolution, and it can be computed by repeatedly applying the transition function 
until an attractor is reached. 

All of these features can be computed assuming synchronous updates or sequential updates according to an 
update-schedule specified by the user. Note that the steady states are the same independent of the update 
schedule. This is due to the fact that updating any variable at a steady state does not change its value. It 
is irrelevant for a steady state analysis whether updates are considered to happen sequentially or 
simultaneously. 
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For probabilistic networks, i.e., models in which each variable has several choices of local update rules, 
ADAM can generate a graph of all possible updates. This means that states in the phase space can have 
out-degree greater than one, since different transitions are possible. ADAM can find all true steady states, 
in the context of probabilistic networks, meaning all states that are time-invariant independent of the 
choice of update function. For further information of probabilistic networks, see [s]. 



For Boolean networks, ADAM calculates all functional circuits (see Appendix A.2|. Positive functional 
circuits are a necessary condition for multi-stationarity. For a certain class of Boolean networks, namely 
conjunctive/disjunctive networks, ADAM computes a complete description of the phase space as described 



in [23] . For further details on conjunctive networks, see Appendix B.2 
In summary, ADAM can generate the following outputs. 

• wiring diagram 

• phase space for small models 

• steady states (for deterministic and probabilistic systems) 

• limit cycles of specified length m 

• trajectories originating from a given initial state until a stable attractor is found 

• dynamics for synchronous or sequential updates 

• functional circuits for Boolean networks 

• a complete description of the phase space for conjunctive/disjunctive networks. 



Comparison to Other Systems 

In this section, we compare ADAM to other state-of-the-art systems. Most software tools discussed here 
provide different functionality than ADAM does, thus a run-time comparison is not feasible. An overview 
of the tools and their functionality is given in table [T] and they are discussed in detail below. 
GINsim (Gene Interaction Network simulation) is a package designed for the analysis of gene regulatory 
networks 0. As input, it accepts logical models. Logical models are an extension to Boolean models; they 
consist of similar switch-like rules, but allow for a finer discretization with more than two states per 
variable, e.g., low, medium, and high. Logical models can be updated synchronously or asynchronously. 
For the latter, the temporal evolution of a logical model is non-deterministic because the variables are 
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Steady States 


Limit Cycles 


Input 


Requirements 


ADAM 


Yes* 


Yes* 


Boolean (or polynomial) functions 
Logical Models (GINsim) 
Petri Net (Snoopy) 


None, web based 


GINsim 


Yes* 


for asynchronous networks* 


parameters (non-zero truth tables) 
Logical Model 


Java virtual machine" 


BoolNct R package 


< 29* 


< 29* 


Boolean functions 


R statistics software 


Snoopy 


reachable from a given initial state 


SBML 
P/T graph 


o 


DDLab 


< 31 


< 31 


truth tables 


o 


GenYsis-P 
Toolbox 


Yes® 


GcnYsis-P specific text file 


Linux that supports the 
distributed binaries 


BN/PBN Matlab 
Toolbox 


< 27 


< 27 


truth tables 


Matlab 



Table 1: Comparison of different software tools regarding attractor analysis: | less than 1 second on 
published gene regulatory networks with up to 72 variables; o only for short limit cycles; f heuristic methods 
are available for larger networks; * asynchronous updates in the sense of logical models; (8) we had no 
platform available that supported the distributed binaries and could not conduct benchmark calculations; o 
installation necessary, available for common operating systems. 



updated randomly in an asynchronous fashion. In either case, updates of every variable are continuous, 
meaning that no variable changes its value by more than one unit in one time-step, see section Remarks 
about Logical Models for a detailed discussion. 

GINsim provides algorithms that use binary decision diagrams (BDD) for the determination of steady 
states and oscillatory behavior |4 . For synchronous updates, analysis of limit cycles is only possible by 
simulating every trajectory, i.e., generating the complete state space, called state transition graph in 
GINsim, and therefore limited by network size. We tested GINsim on logical models with up to 72 
variables; determining the steady states took less than one second. More complex logical networks were not 
available to us. 

Networks are entered manually into GINsim, they cannot be imported from any other format. 
Furthermore, models are specified by entering their parameters, i.e., entering all values that result in a 
non-zero target value. Especially for large models, this can be a time consuming process. 
BoolNet R package provides methods for inference and analysis of synchronous, asynchronous, and 
probabilistic Boolean networks jSj. It is a package for the free statistics software R, and it is run via the R 
command-line. It is helpful, if the user is already familiar with R. Steady state analysis is implemented as 
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exhaustive search of the state space, heuristic search, random walk, or Markov Chain analysis [sj. 
Non-heuristic analysis is limited to networks with 29 variables. For larger networks, steady states can be 
inferred heuristically, which does not guarantee that all steady states are identified. 
Snoopy is a unifying Petri net framework, containing a family of Petri net modeling tools and 
algorithms 1 9 . Snoopy provides built-in simulation and animation. Analysis of Petri nets can be 



performed, e.g., with the tool Charlie 24 . Charlie identifies structural properties and has algorithms for 
invariant based or reachability graph based analysis. Reachability graph-based analysis for Petri nets 
usually depends on a given initial state and does not provide a complete picture of possible dynamics for 
other initial markings. In addition to marking-dependent analysis, Charlie uses algorithms based on linear 
algebra to predict the dynamic properties independent of markings, such as T and P-invariants. ADAM 
converts a Petri net to a collection of polynomial dynamical systems, one system for each transition. 
Analysis of such a non-deterministic system is currently not implemented in ADAM, and we therefore do 
not list any run-time results for Petri nets. 

DDLab is an interactive graphics software for discrete models, including cellular automata. Boolean and 
multi- valued networks [lOj. As it is mainly a visualization tool, analysis is based on exhaustive 
enumeration of the state space, and model size is limited to 31 variables. 

Gen Ysis-P Toolbox is a command- line tool currently available only for Linux to analyze (probabilistic) gene 
regulatory networks. Algorithms use (reduced order) binary decision diagrams. As analysis methods are 
not based on exhaustive enumeration, GenYsis-P can analyze large networks. Unfortunately, we did not 
have access to a platform that supports the distributed binaries, and source code was not available. 
BN/PBN Toolbox is a toolbox written in Matlab |11 . It uses the state transition matrix to compute 
attractors. Statistics for networks with more than 27 variables cannot be computed ("Maximum variable 
size allowed by the program is exceeded"). In addition to analyzing deterministic Boolean networks, the 
toolbox can analyze probabilistic Boolean networks and calculate statistics such as numbers and sizes of 
attractors, basins, transient lengths, Derrida curves, percolation on 2-D lattices, and infiucncc matrices. 
ADAM calculates all steady states of networks non-heuristically, by applying algebraic algorithms, see 
section Methods. All of the above software tools provide other functionality that ADAM currently lacks, 
but for the analysis of synchronous networks, they all are either restricted to less than 32 variables or 
require familiarity with programming. GINsim calculates steady states for large networks, but models must 
be specified by their parameters and cannot be entered in a compact form as such (Boolean) fimctions. 
Several Boolean models too large for analysis by exhaustive enumeration have been published, for example 
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on the expression of the segment polarity genes in Drosophila melanogaster or T-ceU regulation 20 21 . 
ADAM identified all steady states and small limit cycles for these systems in less than one second, whereas 
the publications that comprise the Boolean models, contain lengthy explanations outlining the analysis. 



The model files in ADAM format can be accessed at 22 



Remarks about Logical Models 

ADAM allows for synchronous or sequential updates according to a given update schedule. In models with 
synchronous updates, all variables are updated simultaneously at every time step. In models with 
sequential updates, all variables are updated at every time-step, but in the order of the given update 
schedule. Models with sequential updates can be converted into synchronous models with identical state 
space. In models with asynchronous updates, as it is common for logical models, one variable is updated at 
random at every time step, which results in a non-deterministic model. Sequential and asynchronous 
updates of the same system result in different dynamics. 

In GINsim, all models are "continuous" in the sense that at each time-step, each variable increases or 
decreases by at most one unit. Though logical models are discrete, there are no jumps skipping 
intermediate states. For example, in a model with three states, low, medium, and high, no variable can 
drop from high to low in a single update step. This interpretation is different from the common meaning of 
continuous, which usually refers to models of ordinary or partial differential equations. The parameters 
entered in GINsim specify the target value towards which the variable changes, i.e., the value increases by 
one, decreases by one, or remains constant if the target value is larger, smaller, or equal than the initial 
value, respectively. The phase space generated with ADAM might differ from the state transition graph 
generated in GINsim. To obtain the exact same phase space, every variable in the logical model must 
contain an explicit self-loop, and all parameters must be entered such that the target value differs by at 
most one from the value of the variable to be updated. Any logical model can be specified in this way 
without changing its state transition graph. Boolean models are always continuous. 
In multi-valued logical models, variables can have different maximum values. In an algebraic model, all 
variables are defined over the same algebraic field, i.e., have the same maximum value. When a 
multi-valued logical model is translated into an algebraic model, extraneous states might be introduced 
such that all variables are defined over the same field. An example of such an extension is given in table [2] 
the extra states are the states in the last row, which are given the same values as the states above to 
extend the model in a meaningful way. The extra states should be ignored when analyzing the dynamics. 
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next state of X2 


low X2 


medium X2 


high X2 


Xi absent 


low X2 


medium X2 


high X2 


xi present 


medium X2 


high X2 


high X2 


extension xi present 


medium X2 


high X2 


high X2 



Table 2: Updates for variable X2 in a logical model, where X2 depends on xi and itself. The states and 1 
represent absent and present for the Boolean variable xi; 0, 1, and 2 represent low, medium, and high for 
the multi- valued variable X2- The last row is introduced in the polynomial dynamical system such that all 
variables are defined over F3. The extra states (2, 0), (2, 1), (2, 2) in the state space should be ignored when 
interpreting the dynamics. 



For more details, see 17 . 



Remarks about Petri Nets 

In the Petri net community, state space usually refers to all states (markings) reachable for a given initial 
state. In this manuscript, state space refers to the set of all possible states, independent of an initial state. 
Translating a bounded Petri net to an algebraic model results in a set of polynomial dynamical systems, 
where every transition corresponds to one system. In a Petri net, different firing sequences can lead to 
different markings; a firing sequence relates to the order in which the different systems are iterated. The 
update schedule is not related to the firing sequence. As firing does not consume any time, polynomial 
systems describing a Petri net always use synchronous updates [25] . 

The term functional edge is not related to the concept of liveness of a transition. The liveness of a 
transition depends on an initial marking. An edge, connecting two places (source and target), is functional, 
if there exists a marking, such that changing only the marking of the source place, changes the marking of 



the target place (see Appendix A. 2 for more details on functional edges). 



Application 

We show how to use ADAM on a well-understood model of the expression pattern of the segment polarity 
genes in Drosophila melanogaster. Albert and Othmer developed a model for embryonic pattern formation 
in the fruit fly Drosophila melanogaster |^20|. Their Boolean model consists of 60 variables, resulting in a 
state space with more than 10^^ states. They analyze the model for steady states by manually solving a 
system of Boolean equations. They also analyze the temporal evolution of a specific initial state 
corresponding to the wild type expression pattern by repeatedly applying the Boolean update rules until a 
steady state is found. The update schedule of the model is synchronous with the exception of activation of 
SMO and the binding of PTC to HH (activation of PH), which are assumed to happen instantaneously. 
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compartment 1 


en, EN, hh, HH, SMO 


compartment 3 


ptc, PTC, PH, SMO, ci, CI, CIA 


compartment 3 


SLP, PTC, ci, CI, CIR 


compartment 4 


SLP, wg, WG, ptc, PTC, PH, SMO, ci, CI, CIA 



Table 3: Genes and proteins present in steady state [T] 

This can be accounted for by substituting the equations for SMO and PH into the update rules for other 
genes and proteins, rather than using SMO and PH themselves. 



To analyze the model, we first rename the variables in the Boolean rules given in 20 such as wgj or SLP^ 
to xi . . .xqo, to standardize their format. Then we use ADAM: the model type is Polynomial Dynamical 
Systems, the number of states in a Boolean model is 2, representing present or absent. One can choose 
Boolean, and enter the Boolean rules in the text-area or upload a text file with the Boolean rules. 
Alternatively, one can first convert the Boolean rules to polynomials over F2, and enter the polynomials 



with the choice Polynomial. The file with the polynomial equations for the model can be accessed at 22 . 
The rules in the model file are specified in Polynomial form. Once the polynomials are uploaded, we need 
to set the Analysis type. The model with 60 variables is too complex for exhaustive enumeration, and we 
choose Algorithm. This means that instead of exhaustive enumeration of the state space, analysis of the 
dynamics is done via computer algebra by solving systems of equations. In Options, we set Limit cycle 
length to one because we are interested in the steady states, i.e., time- invariant states. We chose 
.synchronous as updating scheme. Once these choices have been made, we obtain the steady states by 
clicking Analyze. ADAM returns a link to the wiring diagram or dependency graph, which captures the 
static relations between the different variables. In addition, ADAM returns the number of steady states 
and the steady states themselves, see figure [T] These steady states are identical to those found in 20 1, half 
of which have been observed experimentally. 

Each row in the table corresponds to a stable attractor. Attractors are written as binary strings, where 
represents non-expression of a gene (or low concentration of a protein), and 1 expression (or high 
concentration), e.g., 

000111100010000000000011111110100000001001101111000011111110 (1) 

corresponds to the genes (and proteins) being expressed ( or present in high concentration) in four cells 
form anterior to posterior compartments (compartment 1 to 4) as shown in table [3] This is the steady 



state obtained in 20 when starting the system with an initial state representing the experimental 



observations of stage 8 embryos. 
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Pick to view the dependency graph. 
Running analysis now ... 

There are 10 limit cycles of length 1 and they are: 



000000001001101000000001001 lOllOOOOOOOlOOllOLlOOOOOOOlOOllOl 



000111100010000000111100010000111000011111110111000011111110 



00000001111 11100001 111000100001 llOOOOil 111110100000001001101 



011000011111110000111100010000111000011111110100000001001101 



000000011111110000111101000000111000011111110100000001001101 



011000011111110000111101000000111000011111110100000001001101 



000111100010000000000011111110100000001001101111000011111110 



000111101000000000000011111110100000001001101111000011111110 



000111100010000011000011111110100000001001101111000011111110 



000111101000000011000011111110100000001001101111000011111110 



Figure 1: ADAM: Analysis of steady states of Drosophila model. Each row in the table corresponds to a 
stable attractor. Attractors are written as binary strings, where represents non-expression of a gene (or 
low concentration of a protein), and 1 expression (or high concentration) 

ADAM can also generate trajectories for a given initial state. For example, we can choose the initial state 
that was used in [20| representing stage 8 embryos. Again, we enter Polynomial Dynamical Systems with 2 
as the number of states and upload the polynomials describing the model. Instead of Algorithms, we now 
choose Simulation. Since we are not interested in the number of steady states or the complete phase space, 
but in a single trajectory originating from a specific initial state, we choose One trajectory starting at an 
initial state as the simulation option. As initial state we enter 

000101000000000000000010001000100000010001000110000010001000. 

By clicking Analyze, we obtain the temporal evolution of this particular state until it reaches a steady 
state. As predicted in 20 , the steady state is the state described in table [s] see Fig. [2j 



To summarize, ADAM correctly identified the steady states in less than one second. All steady states have 



been determined previously in 20 by labor-intensive manual investigation of the system. 
Furthermore, we used ADAM to verify that there are no limit cycles of length two or three. The model has 
not been analyzed previously for limit cycles. The absence of two- and three cycles strengthens confidence 
in the model, since oscillatory behavior has not been observed experimentally. Computations for limit 
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Initial State 


000101000000000000000010001000100000010001000110000010001000 


after 1 iteration 


000010100011000000000001011100100000001011100111000001011100 


after 2 iteration 


000101000010100000000000111110100000001001101111000000111110 


after 3 iteration 


000110100011010000000010011110100000001001101111000010011110 


after 4 iteration 


000111000010100000000011011110100000001001101111000011011110 


after 5 iteration 


000111100010010000000011001110100000001001101111000011001110 


6 - Fixed Point 


000111100010000000000011111110100000001001101111000011111110 



Figure 2: ADAM: Trajectory of Drosophila model 
cycles of length greater than three have not been conducted, as composing the system several times with 



itself is computationally complex. The model file in ADAM format can be accessed at 22 . 



Benchmark Calculations 



We analyzed logical models available in the GINsim model repository 19 as of August 2010. The 
repository consists of models in GINsim XML format previously published in peer-reviewed journals. We 
converted all but two models into polynomial dynamics systems. For these 26 models we computed the 
steady states. All calculations finished in less than 1,5 seconds, see Figure [3j 



In addition to the published models in 19 , we analyzed randomly generated networks that have the same 
structure that we expect from biological systems, namely sparse, i.e., while the number of nodes in a 
biological network may be quite large, each node is affected only by a small number of other nodes, and 
robust, i.e., small number of attractors.. We tested a total of 50 networks with 50-150 nodes (10^^ — 10^^ 
states) and an average of average in-degrees of 1.6848. The steady state calculations took less than half a 
second for each network on a 2.7 GHz computer. 

Architecture 

ADAM is available as an web-based tool, that does not require any software installation. ADAM's user 
interface is implemented in HTML. We use JavaScript to generate a dynamic website that adapts as the 
user makes various choices. This simplifies the process of entering a model. For example, after defining the 
model type, i.e.. Polynomial Dynamical System, Probabilistic Network, Petri net, and Logical Model the 
next line changes to the number of states, fc-bound, or nothing, appropriately. Input can be entered 
directly into the text-area, or uploaded as a text document. 



All mathematical algorithms are programmed in Macaulay2 26 . Macaulay2 is a powerful computer 



algebra system. The routines for which fast execution is crucial are implemented in C/CH — h as part of the 
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Number of States 



Figure 3: Runtime of steady state calculations of several logical models from 19 . Executed on a 2.7 GHz 
computer. 

Macaulay2 core. Logical Models and Petri nets in XML format are parsed using Ruby's XmlSimple library. 
The interplay between HTML and Macaulay2 is also programmed in Ruby. 

Output graphs are generated with Graphviz's dot command. When Simulation is chosen as analysis 
method, Graphviz's ccomps - connected components filter for graphs is used to count the connected 
components. A Perl script directs the execution of the Graphviz commands. 



Model Repository 

A model repository is part of the ADAM website . The repository consists of a collection of several 
previously published models in ADAM format. The models are extracted from publications, and rewritten 
in ADAM specific format, i.e., all variables are renamed to and the update rules from the original 
publication are reformulated as Boolean rules or polynomials. The central repository with models in a 
unified framework allows for quick verification and experimentation with published models. By changing 
parameters or initial states, users can gain a better understanding of the models. 

New users can also use the repository to quickly familiarize themselves with the main functionalities of 
ADAM. In addition to the model itself, the database entries contain a short summary of the biological 
system and relevant graphs, together with an analysis of dynamic features determined by ADAM and their 
biological explanation. The repository is work in progress by researchers from several institutions 
generating more entries for the repository. We invite all interested researchers to submit their models. 
Because of their intuitive nature, discrete models are an excellent introduction to mathematical modeling 
for students of the life sciences. ADAM's model repository is a great starting point to familiarize students 
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with the abstraction of discrete models such as Boolean networks. 

Conclusions 

Discrete modeling techniques are a useful tool for analyzing complex biological systems and there is a need 
in the biological community for easily accessible analysis software. ADAM provides efficient methods as a 
web-based tool and will allow a larger community to use complex modeling techniques, as it is platform 
independent and does not require the user to understand the underlying mathematics. Upon translating 
discrete models, such as logical networks, Petri nets, or agent-based models into algebraic models, rich 
mathematical theory becomes available for model analysis. 

After extensive experimentation with both discrete models arising in systems biology and randomly 
generated networks, we found that the algebraic algorithms presented in this manuscript are fast for sparse 
systems with few attractors, a structure maintained by most biological systems. All algorithms have been 
included in the software package ADAM 15 1, which is user- friendly and available as a free web-based tool. 
ADAM is highly suitable to be used in a classroom as a first introduction to discrete models because 
students can use it without going through an installation process. 

ADAM provides methods to analyze the key dynamic features, such as steady states and limit cycles, for 
large-scale (probabilistic) Boolean networks and logical models. ADAM unifies different modeling types by 
providing analysis methods for all of them and thus can be used by a larger community. 
We hope to expand ADAM to a more comprehensive Discrete Toolkit which incorporates new analysis 
methods, better visualization, and automatic conversion for more model types. We also hope to analyze 
controlled algebraic models and expand theory to stochastic systems. 

Methods 

Logical models, Petri nets, and Boolean networks are converted automatically into the corresponding 
polynomial dynamical system as described in [T? , so that algorithms from computational algebra can be 
used to analyze the dynamics. In polynomial dynamical systems over a finite field, states of a variable are 
assigned to values in the field, and the update (or transition) rule for each variable is given as a polynomial 
rather than a Boolean or logical expression. For more details, see appendix |A.1[ Using these polynomials, 
one can construct systems of polynomial equations, such that their solutions correspond to fixed points or 
limit cycles. Thus, the problem of identifying attractors becomes equivalent to solving a system of 
polynomial equations over a finite field. This is a long-studied problem in computer algebra, and can 
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usually be solved efficiently by computing a Grobner basis. 

Grobner basis calculation is for polynomial systems what Gauss- Jordan elimination is for linear systems: a 
structured way to transform the original system to triangular shape without changing its solution space. 
The triangular shape of the resulting systems allows for stepwise retrieval of the solutions of the system. 



For a more in depth discussion of Grobner bases, see for example 27 



In the worst case, computing Grobner bases for a set of polynomials has complexity doubly exponential in 
the number of solutions to the system. However, in practice, Grobner bases are computable in a reasonable 
time. It has been suggested, that in robust gene regulatory networks genes are regulated by only a handful 



of regulators 28 . Thus, the polynomial dynamical systems representing such biological networks are 
sparse, i.e., each function depends only on a small subset of the model variables. From our experience, a 
Grobner basis calculation for sparse systems with few attractors, a structure common for biological 
systems, is actually quite fast. 



A Mathematical Background 
A.l Polynomial Dynamical Systems 

To be self-contained, we briefly explain polynomial dynamical systems and their key features. 
A polynomial dynamical system (PDS) [29] over a finite field /c is a function 

/ = (/!,...,/„) :fc"^fc", 

with coordinate functions fi € k[xi, . . . ,Xn]- Iteration of / results in a time-discrete dynamical system. A 
PDS can be used to describe the dynamic behavior of a biological system: every variable Xi corresponds to 
a biological substrate, for example a protein or gene, and the polynomials fi describe the evolution of Xi 
depending on the previous state of the variables xi, . . . ,Xn- 

PDS have several dynamic features of biological relevance. These include the number of components, 
component sizes, steady states, limit cycles, and limit cycle lengths. 

Example Let fc = F2 and / = (A, A, /a) : ^ with 

fl = X1X2X3 + X1X2 + X2X3 + X2 

f2 = X1X2X3 + X1X2 + X1X3 +X1+X2 

/s = X1X2X3 + X1X3 + X2X3 -I- a;i -I- X2- 

The wiring diagram of /, which shows the static interaction of the three variables, is depicted in Figure |4] 
(left) along with its phase space in Figure [4] (right). The phase space shows the temporal evolution of the 
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system. Each state is represented as a vector of the values of the three variables {xi,X2, X3). The PDS 
described by / has two stable attractors: a steady state, (000), and a limit cycle of length three, consisting 
of the states (010), (111), and (Oil). 




Figure 4: (left) Wiring diagram: static relationship between variables (right) Phase space: temporal evolution 
of the system 

A probabilistic PDS over a finite field fc is a collection of functions 

/ = ({/i,i, . . . , /i,, J, . . . , . . . , /„,,„}) : k" ^ k\ 

together with a probability distribution for every coordinate that assigns the probability that a specific 
function is chosen to update that coordinate. The coordinate functions fij are elements in k[xi, . . . ,a;„]. 
Probabilistic PDS, specifically Boolean probabilistic networks (PBN), have been studied extensively in [3]. 
ADAM analyzes probabilistic PDS. It can simulate the complete phase space for small enough models, by 
generating every possible transition and labeling the edge with its probability according to the distribution. 
If no distribution is given, ADAM assumes a uniform distribution on all functions. For large networks, 
ADAM's Algorithm choice computes steady states of probabilistic networks. 

A. 2 Functional Edges 

An edge in the wiring diagram from Xi to Xj is considered functional, if there exists a state 
X = (ii, . . . , Xn) such that fj{xi, . . . , a, . . . i„) 7^ fjixi, . . . , 6, . . . where a and b are values for Xi, in 
other words, if there is at least one state, such that changing only Xi but keeping all other values fixed, 
changes the next state of xj. In ADAM, all edges in the wiring diagram are functional. For Boolean 
networks, ADAM identifies all functional elementary circuits. An elementary circuit is a finite closed paths 
in the wiring diagram where all the nodes are distinct. Functional Circuits are a necessary condition for 
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multi-stationarity and limit cycles. For a further discussion of functional circuits, see [4]. For multivalued 
networks, circuit analysis has not yet been implemented. 

B Algorithms 

B.l Analysis of stable attractors 

Every attractor in a PDS is either a steady state or a limit cycle. For small models, ADAM determines the 
complete phase space by enumeration, for large models, ADAM computes steady states and limit cycles of 
a given length. A state is a steady state, if it transitions to itself after one update of the system. A state is 
part of a limit cycle of length m, if, after m updates, it results in itself. Any steady state of a PDS satisfies 
the equation f{x) — x, as no coordinate of x is changing as it is updated. Similarly, states of a limit cycle 
of length TO satisfy the equation /"(x) — x. ADAM computes all steady states by solving the system 
fi{x) — = for i g {1, . . . , n} simultaneously. To efficiently solve the resulting systems of polynomial 
equations, we first compute the Grobner basis in lexicographic order for the ideal generated by the 
equations. By the elimination and extension theorem, choosing a lexicographic order allows to easily obtain 



the solutions 27 . We use the Grobner basis algorithms distributed with Macaulay2, a computer algebra 



system, and found that for quotient rings over a finite field the implementation 'Sugarless' is more efficient 



than the default algorithm with 'Sugar' 26 30 . For limit cycles of length to, the solutions of /™(a;) — x 



are found and then grouped into cycles, by applying / to each of the solutions. 



Example Fixed points of the system shown in the example in A.l arc solutions in F2 of the system 
f{x) = x: 

X1X2X3 + X1X2 + X2X3 +X2 = Xi 
X1X2X3 + X1X2 + XiXz + Xi + X2 = X2 

XIX2X3 + XIX3 + X2X3 + XI + X2 ^ X3. 

The only solution to this systems is the point {xi,X2,X3) — (0,0,0). This is in accordance with the state 
space depicted in figure [3} (0, 0, 0) is the only fixed point. To investigate limit cycles of length two, one has 
to look at the system /^(x) = x, 

9i{x) = /i(/i(x),/2(x),/3(a;)) = xi * 0:2 + a;2 * X3 = xi 

g2{x) = /2(/i(a;), h{x),h{x)) = xi * a;2 * ^3 + xi * X2 + xi * ^3 + Xi + a;2 = ^2 

9z{x) = /3(/l(x),/2(a;),/3(x)) ^ Xi* X2* X'i+ X2 ^ X-i. 
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Again, (0, 0, 0) is the only solution, which means that there are no limit cycles of length two. 
Investigating f^{x) — x, 

fi{gi{x),g2{x),g3{x)) = xi 
f2{gi{x),g2{x),g3{x)) = X2 

f3{gi{x),g2{x),g3{x)) = X3, 

results in the solutions (0, 0, 0), (0, 1, 0), (0, 1, 1), (1, 1, 1). (0, 0, 0) is a fixed point, and 

(0, 1, 0), (0, 1, 1), (1, 1, 1) are elements of a limit cycle of length three. For all m > 3, f™{x) — x has no 

solutions, that means the system / has exactly two attractors, a fixed point a a limit cycle of length three. 

B.2 Conjunctive/Disjunctive Networks 

Some classes of networks have a certain structure that can be exploited to achieve faster calculations. 
Jarrah et al. show that for conjunctive (disjunctive) networks key dynamic features can be found with 
almost no computational effort |23 . Conjunctive (respectively disjunctive) networks consist of functions 
using only the AND (respectively OR) operator. ADAM comes with an implementation of this algorithm 
to analyze dynamics in the case of conjunctive (disjunctive) networks. Currently, this option is only 
implemented for networks with strongly connected dependency graphs. 
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Figures 

Figure 1 - ADAM: Analysis of steady states of Drosophila model. Each row in the table corresponds 
to a stable attractor. Attractors are written as binary strings, where represents non-expression of a 
gene (or low concentration of a protein), and 1 expression (or high concentration) 

Steady states of Drosophila Melanogaster as found with ADAM. 



Figure 2 - ADAM: Trajectory of Drosophila model 

Temporal evolution of given initial state until steady state is reached. 

Figure 3 - Runtime of steady state calculations of several logical models from [17]. Executed on a 2.7 
GHz computer. 

Runtime of steady state calculations of several logical models from [17]. Executed on a 2.7 GHz computer. 
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Figure 4 - (left) Wiring diagram: static relationship between variables (right) Phase space: temporal 
evolution of the system. 

(left) Wiring diagram: static relationship between variables (right) Phase space: temporal evolution of the 
system. 

Table 1 - Comparison of different software tools regarding attractor analysis: | less than 1 second on 
published gene regulatory networks with up to 72 variables; o only for short limit cycles; f heuristic 
methods are available for larger networks; * asynchronous updates in the sense of logical models; (g) 
we had no platform available that supported the distributed binaries and could not conduct benchmark 
calculations; o installation necessary, available for common operating systems. 

Comparison of different softwares regarding attractor analysis. 

Table 2 - Updates for variable x2 in a logical model, where x2 depends on xl and itself. The states 
and 1 represent absent and present for the Boolean variable xl; 0, 1, and 2 represent low, medium, 
and high for the multi-valued variable x2. The last row is introduced in the polynomial dynamical 
system such that all variables are defined over F3. The extra states (2, 0), (2, 1), (2, 2) in the state 
space should be ignored when interpreting the dynamics. 

Extending state space when converting logical model to polynomial dynamical system. 

Table 3 - Genes and proteins present in steady state 

Steady state 
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